Vector-to-Image Transformation of Character Patterns for On-line and Off-line Recognition

نویسندگان

  • Ondrej Velek
  • Masaki Nakagawa
  • Cheng-Lin Liu
چکیده

This paper proposes a method to generate realistic character images from on-line patterns. From the pen trajectory of an on-line pattern, the proposed method generates images of various stroke shapes using four painting modes: constant line mode, proportional mode and two calligraphic modes. Particularly, the calligraphic modes combine the pen trajectory with real stroke-shape images so that the generated images resemble the characters produced with brush pen. In the calligraphic painting mode based on primitive stroke identification (PSI), the strokes of on-line patterns are classified into different classes and each class of strokes is painted with the corresponding stroke shape template; while in the calligraphic painting mode based on stroke component classification (SCC), each stroke is decomposed into ending, bending and connecting parts, and each part is painted with a stroke shape template. Decomposing strokes into parts is helpful to deal with connected strokes in cursive writing. Our method of image transformation serves two purposes: supplying image samples for off-line recognition, and application of off-line recognition methods to on-line recognition. The experimental results show that our methods are well suited for both purposes. We show that calligraphic painting mode is appropriate for offline recognition while constant line mode and proportional mode are appropriate for on-line recognition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

رفع اعوجاج هندسی متون به‌کمک اطلاعات هندسی خطوط متن

Document images produced by scanners or digital cameras usually have photometric and geometric distortions. If either of these effects distorts document, recognition of words from such a document image using OCR is subject to errors. In this paper we propose a novel approach to significantly remove geometric distortion from document images. In this method first we extract document lines from do...

متن کامل

Virtual Example Synthesis Based on PCA for Off-Line Handwritten Character Recognition

This paper proposes a method to improve off-line character classifiers learned from examples using virtual examples synthesized from an on-line character database. To obtain good classifiers, a large database which contains a large enough number of variations of handwritten characters is usually required. However, in practice, collecting enough data is time-consuming and costly. In this paper, ...

متن کامل

Contours Extraction Using Line Detection and Zernike Moment

Most of the contour detection methods suffers from some drawbacks such as noise, occlusion of objects, shifting, scaling and rotation of objects in image which they suppress the recognition accuracy. To solve the problem, this paper utilizes Zernike Moment (ZM) and Pseudo Zernike Moment (PZM) to extract object contour features in all situations such as rotation, scaling and shifting of object i...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

Efficient Zone Based Feature Extration Algorithm for Handwritten Numeral Recognition of Four Popular South Indian Scripts

Character recognition is the important area in image processing and pattern recognition fields. Handwritten character recognition has received extensive attention in academic and production fields. The recognition system can be either on-line or off-line. Off-line handwriting recognition is the subfield of optical character recognition. India is a multi-lingual and multi-script country, where e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Int. J. Comput. Proc. Oriental Lang.

دوره 15  شماره 

صفحات  -

تاریخ انتشار 2002